Regularizing double machine learning in partially linear endogenous models

نویسندگان

چکیده

The linear coefficient in a partially model with confounding variables can be estimated using double machine learning (DML). However, this DML estimator has two-stage least squares (TSLS) interpretation and may produce overly wide confidence intervals. To address issue, we propose regularization selection scheme, regsDML, which leads to narrower It selects either the TSLS or regularization-only depending on whose variance is smaller. tailored have low mean squared error. regsDML fully data driven. converges at parametric rate, asymptotically Gaussian distributed, equivalent estimator, but exhibits substantially better finite sample properties. uses idea of k-class estimators, show how estimation combined estimate endogenous model. Empirical examples demonstrate our methodological theoretical developments. Software code for method available R-package dmlalg.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Differenced-Based Double Shrinking in Partial Linear Models

Partial linear model is very flexible when the relation between the covariates and responses, either parametric and nonparametric. However, estimation of the regression coefficients is challenging since one must also estimate the nonparametric component simultaneously. As a remedy, the differencing approach, to eliminate the nonparametric component and estimate the regression coefficients, can ...

متن کامل

Persistence Diagrams with Linear Machine Learning Models

Persistence diagrams have been widely recognized as a compact descriptor for characterizing multiscale topological features in data. When many datasets are available, statistical features embedded in those persistence diagrams can be extracted by applying machine learnings. In particular, the ability for explicitly analyzing the inverse in the original data space from those statistical features...

متن کامل

Sparse Partially Linear Additive Models

The generalized partially linear additive model (GPLAM) is a flexible and interpretable approach to building predictive models. It combines features in an additive manner, allowing them to have either a linear or nonlinear effect on the response. However, the assignment of features to the linear and nonlinear groups is typically assumed known. Thus, to make a GPLAM a viable approach in situatio...

متن کامل

Double Deep Machine Learning

Very important breakthroughs in data-centric machine learning algorithms led to impressive performance in ‘transactional’ point applications such as detecting anger in speech, alerts from a Face Recognition system, or EKG interpretation. Nontransactional applications, e.g. medical diagnosis beyond the EKG results, require AI algorithms that integrate deeper and broader knowledge in their proble...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Electronic Journal of Statistics

سال: 2021

ISSN: ['1935-7524']

DOI: https://doi.org/10.1214/21-ejs1931